Defining Antonymy: A Corpus-based Study of Opposites Found by Lexico-syntactic Patterns

ثبت نشده
چکیده

Using small sets of adjectival seed antonym pairs, we automatically find patterns where these pairs co-occur in a large corpus of Dutch, and then use these patterns to extract new antonym pairs. Evaluation of extracted pairs by five human judges showed that automatic scores correlate with human evaluation and that pattern based methods can be used to extract new antonym pairs. The majority of extracted pairs were noun-noun pairs, contrary to expectations based on previous research. Additionally, the method identifies a subgroup of co-hyponyms that frequently function antonymously, and together with more traditional antonyms makes up a wider class of incompatibles, suggesting that antonymy is a diverse relation that includes pairs of different types and categories that are not captured by any single linguistic theory. Comparison with Dutch WordNet and an online Dutch dictionary shows that only a handful of extracted pairs are currently listed in these existing resources, emphasizing the usefulness of the project.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Corpus Approaches to Antonymy

The word “antonymy” was coined in 1867 by CJ Smith to describe word-pairs commonly known as “opposites” such as hot/cold, girl/boy and buy/sell. Some linguists (e.g. Lyons (1977) and Cruse (1986)) apply the term “antonymy” restrictively and would only identify the first of these three pairs as being truly antonymous. However, this seems counter-intuitive in many respects, and this paper will us...

متن کامل

Pattern-Based Distinction of Paradigmatic Relations for German Nouns, Verbs, Adjectives

This paper implements a simple vector space model relying on lexico-syntactic patterns to distinguish between the paradigmatic relations synonymy, antonymy and hypernymy. Our study is performed across word classes, and models the lexical relations between German nouns, verbs and adjectives. Applying nearest-centroid classification to the relation vectors, we achieve a precision of 59.80%, which...

متن کامل

Automatic Extraction of Turkish Hypernym-Hyponym Pairs From Large Corpus

In this paper, we propose a fully automatic system for acquisition of hypernym/hyponymy relations from large corpus in Turkish Language. The method relies on both lexico-syntactic pattern and semantic similarity. Once the model has extracted the seeds by using patterns, it applies similarity based expansion in order to increase recall. For the expansion, several scoring functions within a boots...

متن کامل

Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes

Detecting the linguistic scope of negated and speculated information in text is an important Information Extraction task. This paper presents ScopeFinder, a linguistically motivated rule-based system for the detection of negation and speculation scopes. The system rule set consists of lexico-syntactic patterns automatically extracted from a corpus annotated with negation/speculation cues and th...

متن کامل

Using Web Data to Investigate Antonym Canonicity

In the literature, some researchers (e.g. Gross, Fischer and Miller 1989, Charles, Reed and Derryberry 1994) treat antonym pairs as either canonical (for example old/young, cold/hot and happy/sad) or non-canonical (aged/youthful, cool/hot, happy/miserable), while others assume or argue for a continuum between the two categories (e.g. Herrmann, Chaffin, Conti, Peters and Robbins 1979, Murphy 200...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010